110 research outputs found

    Feasibility of predicting allele specific expression from DNA sequencing using machine learning

    Get PDF
    Allele specific expression (ASE) concerns divergent expression quantity of alternative alleles and is measured by RNA sequencing. Multiple studies show that ASE plays a role in hereditary diseases by modulating penetrance or phenotype severity. However, genome diagnostics is based on DNA sequencing and therefore neglects gene expression regulation such as ASE. To take advantage of ASE in absence of RNA sequencing, it must be predicted using only DNA variation. We have constructed ASE models from BIOS (n = 3432) and GTEx (n = 369) that predict ASE using DNA features. These models are highly reproducible and comprise many different feature types, highlighting the complex regulation that underlies ASE. We applied the BIOS-trained model to population variants in three genes in which ASE plays a clinically relevant role: BRCA2, RET and NF1. This resulted in predicted ASE effects for 27 variants, of which 10 were known pathogenic variants. We demonstrated that ASE can be predicted from DNA features using machine learning. Future efforts may improve sensitivity and translate these models into a new type of genome diagnostic tool that prioritizes candidate pathogenic variants or regulators thereof for follow-up validation by RNA sequencing. All used code and machine learning models are available at GitHub and Zenodo

    CoNVaDING:Single Exon Variation Detection in Targeted NGS Data

    Get PDF
    We have developed a tool for detecting single exon copy number variations (CNVs) in targeted next-generation sequencing data: CoNVaDING (Copy Number Variation Detection In Next-generation sequencing Gene panels). CoNVaDING includes a stringent quality control metric, that excludes or flags low quality exons. Since this quality control shows exactly which exons can be reliably analysed and which exons are in need of an alternative analysis method, CoNVaDING is not only useful for CNV detection in a research setting, but also in clinical diagnostics. During the validation phase, CoNVaDING detected all known CNVs in high quality targets in 320 samples analysed, giving 100% sensitivity and 99.998% specificity for 308,574 exons. CoNVaDING outperforms existing tools by exhibiting a higher sensitivity and specificity and by precisely identifying low-quality samples and regions. This article is protected by copyright. All rights reserved.</p

    Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution

    Get PDF
    We show that epigenome- and transcriptome-wide association studies (EWAS and TWAS) are prone to significant inflation and bias of test statistics, an unrecognized phenomenon introducing spurious findings if left unaddressed. Neither GWAS-based methodology nor state-of-the-art confounder adjustment methods completely remove bias and inflation. We propose a Bayesian method to control bias and inflation in EWAS and TWAS based on estimation of the empirical null distribution. Using simulations and real data, we demonstrate that our method maximizes power while properly controlling the false positive rate. We illustrate the utility of our method in large-scale EWAS and TWAS meta-analyses of age and smoking.</p

    NIPTeR:an R package for fast and accurate trisomy prediction in non-invasive prenatal testing

    Get PDF
    BACKGROUND: Various algorithms have been developed to predict fetal trisomies using cell-free DNA in non-invasive prenatal testing (NIPT). As basis for prediction, a control group of non-trisomy samples is needed. Prediction accuracy is dependent on the characteristics of this group and can be improved by reducing variability between samples and by ensuring the control group is representative for the sample analyzed.RESULTS: NIPTeR is an open-source R Package that enables fast NIPT analysis and simple but flexible workflow creation, including variation reduction, trisomy prediction algorithms and quality control. This broad range of functions allows users to account for variability in NIPT data, calculate control group statistics and predict the presence of trisomies.CONCLUSION: NIPTeR supports laboratories processing next-generation sequencing data for NIPT in assessing data quality and determining whether a fetal trisomy is present. NIPTeR is available under the GNU LGPL v3 license and can be freely downloaded from https://github.com/molgenis/NIPTeR or CRAN.</p

    DNA methylation as a mediator of the association between prenatal adversity and risk factors for metabolic disease in adulthood

    Get PDF
    Although it is assumed that epigenetic mechanisms, such as changes in DNA methylation (DNAm), underlie the relationship between adverse intrauterine conditions and adult metabolic health, evidence from human studies remains scarce. Therefore, we evaluated whether DNAm in whole blood mediated the association between prenatal famine exposure and metabolic health in 422 individuals exposed to famine in utero and 463 (sibling) controls. We implemented a two-step analysis, namely, a genome-wide exploration across 342, 596 cytosine-phosphate-guanine dinucleotides (CpGs) for potential mediators of the association between prenatal famine exposure and adult body mass index (BMI), serum triglycerides (TG), or glucose concentrations, which was followed by formalmediation analysis.DNAm mediated the association of prenatal famine exposure with adult BMI and TG but not with glucose. DNAm at PIM3 (cg09349128), a gene involved in energy metabolism, mediated 13.4% [95% confidence interval (CI), 5 to 28%] of the association between famine exposure and BMI. DNAm at six CpGs, including TXNIP (cg19693031), influencing b cell function, and ABCG1 (cg07397296), affecting lipid metabolism, together mediated 80% (95% CI, 38.5 to 100%) of the association between famine exposure and TG. Analyses restricted to those exposed to famine during early gestation identified additional CpGs mediating the relationship with TG near PFKFB3 (glycolysis) and METTL8 (adipogenesis). DNAm at the CpGs involved was associated with gene expression in an external data set and correlated with DNAm levels in fat depots in additional postmortem data. Our data are consistent with the hypothesis that epigenetic mechanisms mediate the influence of transient adverse environmental factors in early life on long-termmetabolic health. The specific mechanism awaits elucidation.</p

    Mutations in Potassium Channel KCND3 Cause Spinocerebellar Ataxia Type 19

    Get PDF
    OBJECTIVE: To identify the causative gene for the neurodegenerative disorder spinocerebellar ataxia type 19 (SCA19) located on chromosomal region 1p21-q21. METHODS: Exome sequencing was used to identify the causal mutation in a large SCA19 family. We then screened 230 ataxia families for mutations located in the same gene (KCND3, also known as Kv4.3) using high-resolution melting. SCA19 brain autopsy material was evaluated, and in vitro experiments using ectopic expression of wild-type and mutant Kv4.3 were used to study protein localization, stability, and channel activity by patch-clamping. RESULTS: We detected a T352P mutation in the third extracellular loop of the voltage-gated potassium channel KCND3 that cosegregated with the disease phenotype in our original family. We identified 2 more novel missense mutations in the channel pore (M373I) and the S6 transmembrane domain (S390N) in 2 other ataxia families. T352P cerebellar autopsy material showed severe Purkinje cell degeneration, with abnormal intracellular accumulation and reduced protein levels of Kv4.3 in their soma. Ectopic expression of all mutant proteins in HeLa cells revealed retention in the endoplasmic reticulum and enhanced protein instability, in contrast to wild-type Kv4.3 that was localized on the plasma membrane. The regulatory β subunit Kv channel interacting protein 2 was able to rescue the membrane localization and the stability of 2 of the 3 mutant Kv4.3 complexes. However, this either did not restore the channel function of the membrane-located mutant Kv4.3 complexes or restored it only partially. INTERPRETATION: KCND3 mutations cause SCA19 by impaired protein maturation and/or reduced channel function

    Occupational exposure to gases/fumes and mineral dust affect DNA methylation levels of genes regulating expression

    Get PDF
    Many workers are daily exposed to occupational agents like gases/fumes, mineral dust or biological dust, which could induce adverse health effects. Epigenetic mechanisms, such as DNA methylation, have been suggested to play a role. We therefore aimed to identify differentially methylated regions (DMRs) upon occupational exposures in never-smokers and investigated if these DMRs associated with gene expression levels. To determine the effects of occupational exposures independent of smoking, 903 never-smokers of the LifeLines cohort study were included. We performed three genome-wide methylation analyses (Illumina 450 K), one per occupational exposure being gases/fumes, mineral dust and biological dust, using robust linear regression adjusted for appropriate confounders. DMRs were identified using comb-p in Python. Results were validated in the Rotterdam Study (233 never-smokers) and methylation-expression associations were assessed using Biobank-based Integrative Omics Study data (n = 2802). Of the total 21 significant DMRs, 14 DMRs were associated with gases/fumes and 7 with mineral dust. Three of these DMRs were associated with both exposures (RPLP1 and LINC02169 (2x)) and 11 DMRs were located within transcript start sites of gene expression regulating genes. We replicated two DMRs with gases/fumes (VTRNA2-1 and GNAS) and one with mineral dust (CCDC144NL). In addition, nine gases/fumes DMRs and six mineral dust DMRs significantly associated with gene expression levels. Our data suggest that occupational exposures may induce differential methylation of gene expression regulating genes and thereby may induce adverse health effects. Given the millions of workers that are exposed daily to occupational exposures, further studies on this epigenetic mechanism and health outcomes are warranted

    Improved imputation quality of low-frequency and rare variants in European samples using the 'Genome of the Netherlands'

    Get PDF
    Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with 'true' genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05-0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r 2, increased from 0.61 to 0.71. W
    • …
    corecore